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Abstract. The Reflexive Game Theory (RGT) has been recently pro- 
posed by Vladimir Lefebvre to model behavior of individuals in groups. 
The goal of this study is to introduce the Inverse task. We consider meth- 
ods of solution together with practical applications. We present a brief 
overview of the RGT for easy understanding of the problem. We also de- 
velop the schematic representation of the RGT inference algorithms to 
create the basis for soft- and hardware solutions of the RGT tasks. We 
propose a unified hierarchy of schemas to represent humans and robots. 
This hierarchy is considered as a unified framework to solve the entire 
spectrum of the RGT tasks. We conclude by illustrating how this frame- 
work can be applied for modeling of mixed groups of humans and robots. 
All together this provides the exhaustive solution of the Inverse task and 
clearly illustrates its role and relationships with other issues considered 
in the RGT. 
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1 Introduction 

The Reflexive Game Theory (RGT) has been entirely developed by Lefebvre [HIS] 
and is based on the principles of anti-selfishness or egoism forbiddeness |2 [2] 
and human reflexion processes 3;- Therefore RGT is based on the human-like 
decision-making processes. The main goal of the theory is to model behavior of 
individuals in the groups. It is possible to predict choices, which are likely to be 
made by each individual in the group, and influence each individual's decision- 
making due to make this individual to make a certain choice. In particular, the 
RGT can be used to predict terrorists' behavior 4J. 

In general, the RGT is a simple tool to predict behavoir of invididuals and 
influence individuals' choices. Therefore it makes possible to control the individ- 
uals in the groups by guiding their behavoir (decision-making, choices) by means 
of the corresponding influences. 
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On the other hand, now days robots have become an essential part of our hfe. 
One of the purposes robots serve to is to substitute human beings in dangerous 
situations and environments, hke defuse a bomb or radioactive zones etc. 

In contrast, human nature shows strong inchnations towards the risky be- 
havior, which can cause not only injuries, but even threaten the human life. 
The list of these reasons includes a wide range starting from irresponsible kids' 
behavior to necessity to find solution in a critical situation. In such a situation, 
a robot should full-fill a function of refraining humans from doing risky actions 
and perform the risky action itself, if needed. 

However, robots are forbidden and should not physically force people, but 
must convince people on the mental level to refrain from doing a risky action. 
This method is more effective rather than a simple physical compulsion, because 
humans make the decisions (choices) themselves and treat these decisions as 
their own. Such technique is called a reflexive control [3]. 

The task of finding appropriate refiexive control is closely related with the 
Inverse task, when we need to find suitable influence of one subject on another 
one or on a group of subject on the subject of interest. Therefore, it is needed 
to develop the framework of how to solve the Inverse task. This is the primary 
goal of this study. 

However, for better understanding of the gist of the Inverse task and its 
intrinsic relationships with other issues of the RGT, we introduce the entire 
spectrum of the tasks, which can be solved by the RGT. This forms the scope 
of inference algorithms used in the RGT. We present the RGT algorithms in 
the form of the schemas of control systems that can be instantly applied for 
developement of soft- or/and hardware solutions. We develop a hierarchy of 
control systems for abstract individual (including human subject) and robotic 
agent (robot) based on these control schemas. Finally, we illustrate application of 
the Inverse task together with other RGT inference algorithms to model robot's 
behavior in the mixed groups of humans and robots. 

2 Brief Overview of the Reflexive Game Theory (RGT) 

2.1 Representation of groups: graphs, polynomials and stratification 
tree 

The RGT deals with groups of abstract subjects (individuals, humans, au- 
tonomous agents etc). Each subject is assigned a unique variable {subject vari- 
able). Any group of subjects is represented in the shape of fully connected graph, 
which is called a relationship graph. Each vertex of the graph corresponds to a 
single subject. Therefore the number of vertices of the graph is in one-to-one 
correspondence with overall number of subjects in the groups. Each vertex is 
named after the corresponding subject variable. 

The RGT uses the set theory and the Boolean algebra as the basis for calcu- 
lus. Therefore the values of subject variables are elements of Boolean algebra. 
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All the subjects in the group can have either alliance or conflict relationship. 
The relationships are identified as a result of group macroanalysis. It is suggested 
that the installed relationships can be changed. The relationships are illustrated 
with graph ribs. The solid-line ribs correspond to alliance, while dashed ones 
are considered as conflict. For mathematical analysis alliance is considered to be 
conjunction (multiplication) operation (•), and conflict is defined as disjunction 
(summation) operation (+). 

The graph presented in Fig. [T^ or any graph containing any sub-graph isomor- 
phic to this graph are not decomposable. In this case, the subjects are excluded 
from the group one by one, until the graph becomes decomposable. The exclusion 
is done according to the importance of the other subjects for a particular one 
[Tl[2]. Any other fully connected graphs are decomposable. Any decomposable 
graph can be presented in an analytical form of a corresponding polynomial. Any 
relationship graph of three subjects is decomposable (see [Illl]). 

Consider three subjects a, b and c. Let subject a is in alliance with other 
subjects, while subjects b and c are in conflict (Fig. [T|d). The polynomial corre- 
sponding to this graph is a{b + c). 



a c a a 




Fig. 1. The relationship graphs. 



[b] + [c] 
[a] ■ [b+c] 




[a(b+c)] 

Fig. 2. Polynomial Stratification Tree. Polynomials [a], [b] and [c] are elementary poly- 
nomials. 

Regarding a certain relationship, the polynomial can be stratified (decom- 
posed) into sub-polynomials [U [2] . Each sub-polynomial belongs to a particular 
level of stratification. If the stratification regarding alliance was first built, then 
the stratification regarding the conflict is implemented on the next step. The 
stratification procedure finalizes, when the elementary polynomials, containing 
a single variable, are obtained after a certain stratification step. 

The result of stratification is the Polynomial Stratification Tree (PST). It 
has been proved that each non-elementary polynomial can be stratified in an 
unique way, i.e., each non-elementary polynomial has only one corresponding 
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PST (see [7] considering one-to-one correspondence between graphs and polyno- 
mials) . Each higher level of the tree contains polynomials simpler than the ones 
on the lower level. For the purpose of stratification the polynomials are written 
in square brackets. The PST for a{b + c) polynomial is presented in Figj2j 

Next, we omit the branches of the PST and from each non-elementary polyno- 
mial write in top right corner its sub-polynomials. The resulting tree-like struc- 
ture is called a dia(;ona//orm[ll[2l[5l[6]. Consider the diagonal form correspond- 
ing to the PST in Fig. [2j 

[b] + [c] 

[a][b + c] 

[a{b + c)] 

Hereafter, the diagonal form is considered as a function defined on the set of 
all subsets of the universal set. The universal set contains the elementary actions. 
For example, these actions are actions a and f3. By definition, the Boolean algebra 
of the universal set includes four elements: 1 = {a, /?}, {a}, {/?} and the empty 
set = {} = 0. These elements are all the possible subsets of universal set and 
considered as alternatives that each subject can choose. The alternative = {} 
is interpreted as an inactive or idle state. In general. Boolean algebra consists of 
2" alternatives, if universal set contains n actions. 

Accroding to definition given by Lefebvre [5], we present here exponential 
operation defined by formula 

= P + W , (1) 
where W stands for negation of [D [21 E] ■ 

This exponential operation is used to fold the diagonal form. During the 
folding, round and square brackets are considered to be interchangeable. The 
following equalities are also considered to be true: x + x = l,x + = x and 
a; -|- 1 = 1. Next we implement folding of diagonal form of polynomial a(b + c): 

[b] + [c] 

[a][b + c] [a]{[b + c] + [b] + [c]) 

[a{b + c)] ^[a{b + c)] =a{b + c) + a. 

It is considered that the levels of the PST represent different processing levels 
of natural or artificial cognitive system. Each level is considered as an images. 
The root of the tree is the input into the cognitive system and, therefore can be 
considered as the image of the world (environment including self and others), 
perceived by the subject. 

As it follows from the PST, there is a hierarchy of images, corresponding 
to a particular cognitive level. During processing along this hierarchy in the 
bottom-up manner, the image on the lower level undergoes an extensive process 
of simplification by the means of decomposition into simpler parts on the higher 
level. These parts are considered to be the images of the image on the previous 
level. Therefore, the images on the second level are different representions of the 
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original image of the world. This procedure repeats until we obtain elementary 
part (elementary polynomials) [TJ H] . 

On the other hand, the PST folding procedure can be referred as top-down 
intergration process of simpler images from the higher levels. 

Therefore, the stratification procedure of original polynomial together with 
the folding procedure of the diagonal form illustrate the interplay of bottom-up 
and top-down information processes, which are widely imployed in biological 
[HI m Uni E] and artificial [TH [131 [H] information processing systems. The idea 
of hierarchical structure is highly coherent with hierarchical organization of ma- 
jority of natural (inanimate objects) and biological (living creatures) entities. 
Furthermore, it has been shown that hierarchical structure is intrinsic for the 
relationships in societies of insects [15] , animals [171 UHl [IHj and human beings. 

Therefore hierarchical representation of the groups in the form of PST corre- 
spond to extraction of the hierarchical structure of the given group, while fusion 
of the PST and its diagonal form with diagonal form folding procedure closely 
resembles the way of information processing within a single independent congni- 
tive system as discussed above. Thus, RGT imploys the fundamental principles 
of hierarchical organization on both group (reflects structure of the groups) and 
individual (illustrates information processing within independent cognitive sys- 
tem of a single unit) levels. This makes RGT universal tools that mildly bridges 
the gap between representation and analysis. 

2.2 The Decision Equation: definition and solution 

The goal of each subject in a group is to choose an alternative from the set of 
alternatives under consideration. To obtain choice of each subject, we consider 
the decision equations, which contain subject variable in the left-hand side and 
the result of diagonal form folding in the right-hand side: 

a = {b + c)a + a 
b = {b + cja + a 
c = (5 + cja + a 

To find solution of the decision equations, we consider the following equation: 

Ax + Bx , (2) 

where x is the subject variable, and A and B are some sets. Eq.([2]) represents 
the canonical form of decision equation. This equation has solution if and only 
if the set B is contained in set A: A ^ B. If this requirement is satisfied, then 
eq.([2| has at least one solution from the interval A x ^ B [3|. Otherwise, the 
decision equation has no solution, and it is considered that subject cannot make 
a decision. In such situation, the subject is in frustration state. 

Therefore, to find solutions of decision equation, one should first transform 
it into the canonical form. Out of three presented equations only the decision 
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equation for subject a is in the canonical form, while other two should be trans- 
formed. We consider explicit transformation only of decision equation for subject 
b HO]: 

a{b + c) +a = ab + ac + a ^ ab+ {ac + a)b+ {ac + a)b = {a + a + ac)b + {ac + a)b = 
(1 + ac)b + (ac + a)b = b + [ac + a)b = b + {ac + ac + a)b = + (c + a)b. 

Therefore, 

b = b+[c + a)b. (3) 

The transformation of equation for subject c be can be easily derived by 
analogy: c = c + (6 + a)c. 

Next we consider two tasks, which can be formulated regarding the decision 
equation in the canonical form and provide methods to solve each task. 

2.3 The Forward Task 

The variable in the left-hand side of the decision equation in canonical form is 
the variable of the equation, while other variables are considered as influences 
on the subject from the other subjects. The Forward task is formulated as a task 
to find the possible choices of a subject of interest, when the influences on him 
from other subjects are given. 

After transformation of arbitral decision equation into its canonical form, 
the sets A and B are functions of other subjects' influences. For example, if we 
consider group of subjects a, 6, c, etc. togehter with the abstract representation 
of decision equation in canonical form for subject a, the sets A and B will be 
the functions of subject variables 6, c, etc. : 

A{b,c,...)a + B{b,c,...)a . (4) 

In the case of only three subjects a, b and c, c, ...) = A{b,c) and 
B(6,c,...) =B(5,c). 

All the influences are presented in influence matrix (Table [T]). The main 
diagonal of influence matrix contains the subject variables. The rows of the 
matrix represent influences of the given subject on other subjects, while columns 
represent the influences of other subjects on the given one. The influence values 
are used in decision equations. 

Table 1. Influence Matrix 





a 


b 


c 


a 


a 


{a} 




b 


m 


b 
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c 


m 




c 
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For subject a: a ~ ({/?} + {(3})a + a a = {/3}a + a. 

For subject b:b^b+ ({a}{^} + {a})b ^b^b + {p}b. 

For subject c: c = c + ({/3}{/3} + {l3})c ^ c = c + ({/3} + {a})c ^ c = 1. 

Equation for subject a does not have any solutions, since set A = A{b, c) = 
{/?} is contained in set B = B{b,c) — 1: A C B. Thus, subject a cannot make 
any decision. Therefore he is considered to be in frustration state. 

Equation for subject b has at least one solution, since A = A{b, c) = 1 = 
{a,/3} 2 B = B{b,c) = {/?}. The solution belongs to the interval l^bD {/?}. 
Therefore subject b can choose any alternative from Boolean algebra, which 
contains alternative {/?}. These alternatives are 1 = {a,f3} and {/?}. 

Equation for subject c turns into equality c = 1. This is possible only in the 
case, when A{b, c) = B{b, c). Here A = B = 1. 

2.4 The Inverse Task 

In contrast to the Forward task, the Inverse task is formulated as a task to 
find all the simultaneous (or joint) influences of all the subjects together on the 
subject of interest that result in choice of a particular alternative or subset of 
alternatives. We call the subject of interest to be a controlled subject. 

Let subject a be a controlled subject and a* is a fixed value, representing an 
alternative or subset of alternatives, which subjects b, c, etc. want subject a to 
choose. We call value a* to be a target choice. By substituting subject variable a 
with fixed value a* , we obtain the influence equation. If we substitute the subject 
variable a with fixed value a* in the canonical form of the decision equation (eq. 
Q), we obtain the canonical form of the influence equation: 

a* = A{b,c,...)a* + B{b,c,...)^ , (5) 

For only three subjects a, b and c, A{b,c,...) = A{b,c) and B{b,c,...) = 
B{b,c). 

In contrast to the decision equation, which is equation of a single variable, 
the influence equation is the equation of multiple variables. However, the number 
of variables of influence equation is not trivial question. In fact, the number of 
variables in influence equation can be less then {n — 1), where n is the total 
number of subjects in the group. There are groups, in which sets A and B are 
functions of less than {n—1) variables (see Appendix|A]). Therefore the variables 
that present in influence equation are called effective variables. 

The Inverse task is by definitiorj^ formalized as to find all the joint solutions 
of all subjects in the group, except for the controlled one, when the target choice 
is represented by interval xi 12 ci* 5 X2i where xi X2 are some sets and 
Xi 3 X2- In such a case, to solve the Inverse task, one should solve the system 
of influence equations: 

^ We need a system of influence equations because solutions of the influence equation 
a* = A{b, c, ...)a* + B{b, c, ...)a* itself only guaratee that the original decision equa- 
tion a — A{b, c, ...)a + B{b, c, ...)a turns into true equality, but it is not guaranteed 
that these solutions are the only ones that turn decision equation into true equality. 
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(Aib,c,...) = xi (6) 
lB(5,c,...) = X2 (7) 

If the target choice is a single alternative, then xi = X2 = o-* ■ 

The solutions of the system ([^[^ are considered as reflexive control strategies. 

The solution of the Inverse task in particular is characterized from two points. 
The first point is whether it is required to find the influence of a particular single 
subject or joint influences of a group of subjects. The second one is whether the 
target choice is represented as a single alternative or as an interval of alternatives. 

To illustrate these points, we introduce a particular group of subjects. Let 
subjects a and b are in alliance with each other and in conflict with subject 
c. The polynomial corresponding to this graph is ah + c. The diagonal form 
corresponding to this polynomial and its folding is 

[a][b] 
[ab] +[c] 
[ab + c] = ab + c 

Therefore the decision equation for all the subjects in the group is 

X ^ ab + c, (8) 

where x can be any subject variable a, b or c. 

Influence of a single subject vs joint influences of a group. First we consider 
example, when the influence of a single subject is required. Let subject b makes 
influence {a} and a* = {a}. Then we need to find influences of a single subject 
c, which result in solution a* = {a} of decision equation a = ab + c. 

The canonical form of this influence equation is a* ~ (b + c)a* + ca* . Since 
a* — {a}, Xi = X2 = {a}, we obtain a system of equations: 



{a} + c^{a} (9) 
c = {a} (10) 

Therefore, the straight forward solution of this system is c = {a}. 

This simple example illustrates the very gist of the Inverse task - to find the 
appropriate influences, which result in target choice. 

Next, we consider that influence of subject b is not known. Therefore, we 
obtain system 

'b + c^{a} (11) 
c = {a} (12) 

In this case, we need to find the values of variable b, which together with 
c, result in solution a* = {a}. In other words, we need to find all the pairs 
(6, c), resulting in solution a* — {a}. These pairs are solutions of the system 
([ll]|l2]). Therefore, we run all the possible values of variable b and check if the 



first equation of the system ( TI]|l2 ) turns into true equality: 
6 = 1 : l + {a} = 1 ^ 1 7^ {a}; 
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b = {a} : {a} + {a} = {a} ^ {a} =■ {q}; 
6 = {/?} : {/?} + W = 1 1 ^ {a}; 
6 = : + {a} = {a} =4> {a} = {a}. 

Therefore, out of four possible values of variable 6, only two values {a} and 
arc appropriate. Thus, wc obtain two pairs c): ({a}, {a}) and ({aj.O). 

A single target alternative vs interval of alternatives. In the previous examples 
we considered a target choice to be only a single alternative. Here we illustrate 
the case, when a target choice is an interval. Let b — {/?}, and 1 2 a* 15 {a}. To 
find corresponding influences of subject c, we solve the system of equations: 



Again, we instantly obtain the sohition of this system: c — {a}. 

In this section, we have formulated the Inverse task in general and considered 
its particular formalization depending on the number of influences and what is 
the target choice. However, we do not have a method to solve arbitral influence 
equation. Therefore, we solve this problem in the next section. 

3 How to Solve an Arbitral Influence Equation 

As an introduction for this section, we consider the fundamental proposition, 
which will be the Conner stone to solve the influence equations. 

Proposition 1. Let P and Q be some abstract sets. Then PQ + PQ = P = 




(13) 
(14) 



Q. 



Proof. Necessity. Let PQ + PQ = 0, then 



PQ + PQ = Q=>PQ + PQ + P = P^P + PQ = P^ 
P{Q + Q) + PQ = Q + PQ + PQ = P ^ Q = P. 



Therefore ii PQ + PQ = 0, then P = Q. 

Sufficiency. Let P = Q, then PP + PP = 0.D 



Now let us consider the new type of equation: 



Aix + Bix = 



(15) 



This equation has solution if and only if Ai ^ x D Bi. 



10 Sergey Tarasenko 

3.1 Solving Influence Equations 

There are three operations defined on the Boolean algebra. They are conjunc- 
tion (• or multiplication), disjunction (+ or summation) and negation {x, where 
X is subject variable). The negation operation is unary operation, while other 
two operations are binary. Using combination of these three operations, we can 
compose any influence equation. Since, it is obvious how to solve the equation 
including only unary operation, we discuss how to solve influence equations in- 
cluding a single binary operation. 

For this perpose, we consider two abstract subject variables xi and X2 and 
abstract alternative x- 

Lemma 1. The solution of equation 

X1+X2 = X (16) 

regarding variable Xi, where i = 1,2, is given by the interval x 3 3 ix^j + 
xjx), where j = 1, 2; j 7^ i. 

Proof. According to Proposition 1, P = xi + X2, Q ~ Xt P ^ ^ ^2 — ^ 
and Q = x- _ _ 

Therefore, PQ + PQ = [xi + X2)x + ^ = ^iX + X2X + ^ x^X- Conse- 
quently, we obtain eq.(17): 

a;iX + a;2X + ^X^ = (17) 



We solve eq. ( 17 1 regarding variable Xi . First, we transform cq. (171 into canon- 
ical form: 

XXi + {XX2 + XX2)xl = (18) 



Therefore, the solution of eq.(18) is given by the interval 

X^xi^{xx2+xix)- (19) 
Since variables Xi and X2 are interchangable and it is possible to solve eq.(|17[) 



regarding variable X2 as well, the general form of solution of eq.( 16 ) is the interval 

X^x,^ {xxj + xJx)- (20) 

where i = 1,2 and .7 = 1, 2; j =/= i.O 

Lemma 2. The solution of equation 

X1X2 = X (21) 

regarding variable Xi, where i = 1,2, is given by the interval (xxj +X Sj) ^ a;^ 3 
X, where j — 1, 2; j 7^ i. 
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Proof. According to Proposition 1, P — X1X2, Q = Xi P — X1X2 — xi + X2 and 

Q = x- _ _ ___ ___ 

Therefore, PQ + PQ = {xiX2)x + i^i + X2)x = 2^2X2^1 + ^iX + ^2X- 



Thus, we obtain eq.(22|: 



X2X^\ + ^\X. + ^iX = 



(22) 



We solve eq. ( 22 1 regarding variable x\ . First, we transform eq. ( 22 1 into canon- 
ical form: 

(xa;2 + XX2)x\ + y^xi = (23) 



Since xx2 + x^i = X^2 +X ^2, the solution of eq.(23) is given by the interval 
{XX2 + X ^) 3 2^1 3 X- (24) 



Since variables xi and X2 are interchangable and it is possible to solve eq.([22| 
regarding variable X2 as well, the general form of solution of eq.( 21 ) is the interval 

ixxj +Xxj)^x^Dx- (25) 
where i = 1, 2 and j — 1,2; j i.D 



Since one bound of the solution intervals for cqs. ( 16 ) and ( 21 1 are functions of 
the second variable, we need to run all the possible values of the second variable 
in order to obtain all possible solutions of these equations in the form of pairs 

{xi,X2)- 

Next we consider several examples, illustrating application of Lemmas [l] and 

m 

Example 1. For illustration, we solve equation a* ~ ba* +c. Consider x = a*, 
xi — ba* and X2 = c, we obtain the solution interval for variable X2 — c: 
X D c 3 (xx^ + X X^)- After simplfication, we get interval (26): 



X^c^xb 



(26) 



Next we consider examples with particular alternatives. Let it be alternative 
{a} : X = {q^}- The solution interval is then {a} 3 c 3 {a}b. Since the lower 
bound of this interval is a function of variable b, to find all solutions of equation 
a* = ba* + c, we calculate value of expression {a}b for all possible values of 
variable b (Table [2]). 

To reesure that solutions are correct, we check that decision equation a = 
ba + c turns into true equality for the obained pairs (&, c): 

{{a}, {a}): {a}{a} + {a} = {a} =^ {a} = {a} is true; 
{{a}, 0): {Q;}{a} + = {a} {a} = {a} is true; 
({/3}, {a}): {a}{/3} + {a} = {a} ^ {a} = {a} is true; 
(1, {a}): {a}l + {a} = {a} => {a} — {a} is true; 
(1, 0): {a}l + = {a} => {a} = {a} is true; 
(0, {a}): {a}0 + {a} = {a} =^ {a} = {a} is true. 
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So far, we have illustrated how to solve the influence equation. We as well 
showed that the pairs {b, c) obtained by solving equation a* = ba* + c in ac- 
cordance with Proposition 1 and Lemmas 1 and 2 are indeed solutions of this 
equation. 

Table 2. Solutions of the influence equation a* = ba* + c 
Values of b {a} 1 

Pairs (6,c) W) (1> W) (0, {a}) 

(W,0) (1,0) 



Example 2. We consider influence equation for subject b obtained from eq.([3]). 

{c + a)x + X = X (27) 



First, we transform the left-hand side of eq.(27|: 

(c -I- a)x + X — (^X + o^ + X = cx + ax+{c + a+ l)x = c + a + x- 



Therefore, eq.(27) can be rewritten as follows: 

c+a+x^X 



(28) 



Considering, xi = c and X2 = a + x, we instantly obtain the solution interval 



of eq.(128|: X 3 c D (x(a + x) + xia + x)) ^ X ^ c D {x a + xxa). 
Finally, 

X 3 c 3 X a 

Example 3. Next, we consider influence equation 

ab + x = X 



(29) 



(30) 



Considering, xi — ab and X2 = x? we instantly obtain the solution interval 
X^abD {xx + xx) or 

X^abDO (31) 



Therefore, in order to find all solutions of eq.(30), we need to solve the equa- 
tions 

ab = y (32) 

where y is any sub-set of set x (y 3 x)- 

Each equation can be solved according to Lemma |2] 

Example 4- As a final example, we again consider influence equation a* = 
{b + c)a* + ca* and show how application of Lemma [l] essentially simpliflcs its 
solution. We get the system of influence equations: 
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b + c~ {a} ; 
c = {a} . 



(33) 

(34) 



From this system we obtain a single equation: 



b + {a} = {a}. (35) 
According to Lemma [Tj we instantly obtain the solution interval of eq. ( 35 ) : 

{ajDbDO . (36) 



Thus, eq.(35l has two solutions: b = {a} and 6 = 0. Therefore the solution 



of system ( 33 



34| consists of two pairs ({a}, {a}) and (0, {a}). 



To conclude this section, we provide its brief summary. We have shown how 
to solve the Inverse task by means of influence equations. We have proved two 
fundamental lemmas, which allow to solve any influence equation regardless of 
the number of variables. Finally, we have illustrated several examples of how 
apply these lemmas. 



3.2 Analysis of Extreme Cases 1: Frustration 

In this section we analyze the situation, when subject can appear in frustration 
state, from the point of view of the inverse task. Let us consider the polynomial 



a(b + c) discussed in the section 2.1 The decision equation that corresponds to 
this polynomial is x = {b + c)a + a, where x can be any subject variable. 

Next we try to find all the pairs {b, c) such that result in selection of a 
particular alternative by subject a. 

The decision equation for subject a is a = (6 + c)a + a. The solution interval 
of this decision equation is6 + c3aI3 1. We need to check which alternative 
subject a can be convinced to choose. To do this, we consider the system of 
equation for each alternative. 

Alternative {a}: 

'b + c={a} (37) 
1 = {a} (38) 

Alternative {/3}: 

b + c = {/3} (39) 
1 = {/?} (40) 

b + c = (41) 
1 = (42) 

In these systems the second equation is incorrect equality. Therefore these 
systems have no solution. 
Alternative 1 — {a,/3}: 



Alternative = {}: 
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b + c=l 
1 = 1 



(43) 
(44) 



The second equation is correct equality. Therefore this system has solution. 

Thus, out of four possible alternatives, subject a actually can choose only 
alternative 1 = {a, (3}. To find solutions, resulting in selection of the alternative 
1 — {a,/?}, we need to solve only eq.(43), since eq.(44| turns into the true 
equality. 

According to Lemma [l] we instantly obtain the solution interval for eq.(43): 

IDbDc (45) 
We calculate the pairs (6, c) for all possible values of variable c (Table |3]). 



Table 3. Solutions of the influence equation b + c — 1 

Values of e {a} {fi} 1 

({/?}, W) ({a}, {/?}) (0,1) (1,0) 
Pairs (b,c) (l,{a}) (1, {/?}) ({a}, 1) 

({/3},1) 
(1,1) 



Therefore, the influence analysis of the decision equation a — {b+c)a-\-a shows 
that the only alternative that subject a can choose is alternative 1 = {a, /?}. The 
influence analysis provides us with the set (exhaustive list) of pairs (6, c) of joint 
influences resulting in selection of alternative 1 = {a,/3}. Therefore, if the pair 
of influences does not match any pair from this list, the decision equation has 
no solution and this results in frustration state. 

Summarizing, this section we note that in general there are two sets. The set 
D contains alternatives that a controlled subject can choose. The set U is the 
set of altertanives of the target choice. Therefore, the need to put subject a into 
frustration state emerges, if the target choice of a controlled subject cannot be 
made by this subject. In other words, we need to put a subject into frustration 
state, if D n U = 0. 

3.3 Analysis of Extreme Cases 2: What to do with Super- Active 
Groups 

Among all the possible groups, there are groups, in which subjects will always 
choose only the alternative 1 = {a,/?} regardless of the influence of other sub- 
jects. Such groups are called super-active groups. 
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Next we consider one special case of super active groups - the homogenous 
groups. The group is called homogenous^ if all the subjects in the group are 
connected with the same relationship. 

Here we provide proof of the lemma about homogenous groups originally 
formulated by Lefebvre [TJ ^ . 

Lemma 3. Any homogenous group is the super-active group. 

Proof. We consider the homogenous groups, where all the subjects are connected 
with alliance (alliance groups) and conflict (conflict groups) relationship, sepa- 
rately. 

Without loss of generallity, we suggest that there are n subjects ai, 02, a„. 

Alliance groups. The polynomial corresponding to the alliance group of n 
subject is aia2...a„. Next we construct the diagonal form and apply folding 
procedure: 

[oi][a2]...[a„] 

[0102. ..o„] = [aia2...a„] + [ai][a2]...[a„] = 1 . 

Therefore the alliance groups are always super-active. 

Conflict groups. The polynomial corresponding to the conflict group of n 
subject is oi -|- a2 -l- ... -I- a„. Next we construct the diagonal form and apply 
folding procedure: 

[fli] + [0-2] + ■•• + [an] 

[oi + 02 + ... + a„] = 

[oi + 02 + •■• + a„]+ [oi] + [02] + ... + [a„] = 1 . 

Therefore the conflict groups are always super-active. 

Since both the alliance and the conflict groups are super-active, this lemma 
is proved. □ 

However, there are non-homogenous super-active groups as well (see Ap- 
pendix |B]) . 

Summarizing this section, we note that subjects in the super-active groups 
cannot be controlled in their choices and the entire groups is uncontrolable. 
Therefore, once the super-active groups emerges, the only way to make it con- 
trollable is to change the relationships in the group. 

4 The Basic Control Schema of an Abstract Subject 
(BCSAS) in the RGT 

We have presented the detailed description of the RGT including solution of 
the Forward and Inverse tasks. We have also considered the extream cases of 
decisions like putting a subject into frustration state or changing structure of a 
super-active group. As a final stroke, we summarize all the presented material in 
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Fig. 3. The Block schema for extracting sets Dh and Z^- 



the form of Basic Control Schema of an Abstract Subject (BCSAS) in the RGT. 

The input comes from the environment and is formaUzed in the form of exter- 
nal Influences on the subject, the Boolean algebra of Alternatives and Structure 
of a Group. 

Information about the Influences, Boolean algebra and Group Structure is 

propagated into the Decision Module. The Decision Module implements sohition 
of the Forward task. Therefore the output set D of the Decision Module is the 
set of possible alternatives, which subject can choose under the given conditions. 

The information about Boolean algebra and Group Structure is propagated 
into the Influence Module. The Influence Module solves the Inverse task. The 
output set D/j of the Influence Module is the set of the pairs (x, Z^)^, where % is 
the target alternative, the set Z-^ is the set of all the joint influences, resulting in 
selection of the target choice; and x represents a subject variable. Each {x^Z^)^ 
represents a reflexive control strategy. 

Therefore, the decision to put a subject into frustration state is justified if 
it is impossible to make subject x choose the target alternative %, i.e., if for pair 
{x,Z-^)x set Z^ = {}, and subject x should not choose any other alternative 
except for the target one. 
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4.1 Schema for Iterative Algorithm to Obtain Output of the 
Influence Module 

The alternatives x with corresponding non-empty sets Zy- are included into the 
set Dft,. Here we introduce set "Lh to store the non-empty sets Z^. The schema 
of the algorithm for extracting sets D^j and Z/j is presented in Fig. [3j First the 
sets and Z/j are empty: = {} and Z/i = {}. The algorithm reads the set of 
pairs (x, 2^)^ and stores it in array Pairs{M), where M is a counting variable, 
A'' is the total number of pairs. Then it is checked for each pairs from array 
Pairs whether set Z^^ is empty: —~ {}? . If 'yes', the algorithm increments 
counting variable M{M = M + 1) and proceeds to the next pair from array 
Pairs. If 'no', then alternative x is included into the set D/j(D;j = D/j -t- x), set 
D/i is saved, the set Z^ is included into set Zh {Zh = Zh + Z^) and set Zh is 
saved. The process is run while M < N. 

In this iterative algorithm, we separately store the alternatives x ? which can 
be chosen by a certian subject, in the set and the joint influences Z^ , which 
result in selection of alternative %, in the set Z^. 

Therefore, we should modify the schema of Influence Module in BCSAS as 
follows. We present elaborated schema, where sub-module "Solution: D/i" is ac- 
companied with sub-module "Solution: Z/i". Together these sub-modules are 
included into the "Solutions" sub-module. 

BCSAS is the fundamental schema of an abstract subject, which is used 
through out the RGT. The BCSAS is presented in Fig|4] 

This concludes the overview of RGT and description of tasks within the scope 
of the general theory. Therefore, we continue with application of the RGT to the 
mixed groups of humans and robots. 
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Fig. 4. The Basic Control Schema of an Abstract Subject (BSCAS). 
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5 Defining Robots in RGT 

As we have noted in the Introduction section, the goal of the robots in mixed 
groups of humans and robots is to refrain human subject from choosing risky 
actions, which might result in injuries or even threaten live. 

It is considered by default that robot follows the program of behavior. Such 
program consists of at least three modules. The Module 1 implements robot's 
ability of human-like decision-making based on the RGT. The Module 2 contains 
the rules, which refrain robot from making a harm to human beings. The Module 
3 predicts the choice of each human subject and suggests the possible reflexive 
control strategies. 

The Modules 1 and 3 are inhereted from the BCSAS of an Abstract Individ- 
ual. They correspond to Decision Module and Influence Module of the BCSAS 
(Fig.|4]), respectively. Therefore all the properties and meaning of outputs of the 
Modules 1 and 3 are the same as the ones for Decision and Influence modules, 
respectively. 

The Module 2 is the new module, which is intrinsic for robotic agents studied 
in the context of mixed groups of humans and robots. This module is responsible 
for extraction of only harmless or non-risky alternatives for human subject. 

We suggest to apply Asimov's Three Laws of robotics [TST , which formulate 
the basics of the Module 2: 

1) a robot may not injure a human being or, through inaction, allow a human 
being to come to harm; 

2) a robot must obey any orders given to it by human beings, except where such 
orders would conflict with the First Law; 

3) a robot must protect its own existence as long as such protection does not 
conflict with the First or Second Law. 

We consider that these laws are intrinsic part of robots " mind" , which cannot 
be erased or corrupted by any means. 

The interaction of Modules 1 and 2 is performed in the Interaction Module 
1. The interaction of Modules 3 and 2 is implements in the Interaction Module 
2. 

The Boolean algebra is filtered according to Asimov's laws in Module 2. 
The output of Module 2 is set U of approved alternatives. This data is then 
propagated into interaction modules. 

The output of the Module 1 is set D of alternatives, which robot has to choose 
under the given joint influences. In the Interaction Module 1, the conjunction of 
sets D and U is performed: D n U = DU. If set DU is not empty set, this means 
that there are aproved alternatives among the alternatives that robot should 
choose in accordance with the joint influences. Therefore, robot can implement 
any alternative from the set DU. If set DU is empty, this means that under given 
joint influences robot cannot choose any approved alternative, therefore robot 
will choose an alternative from set U. This is how the Interaction Module 1 
works. 

The output of the Module 3 contains sets D/i and . The goal of the robot 
is to refrain human subjects from choosing risky alternative. This can be done 
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Fig. 5. The Basic Control Schema of a Robotic Agent (BCSRA). 



by convincing human subjects to choose alternatives from the set U. First, we 
check whether D/i contains any approved alternative. We do so by performing 
conjunction of sets D?, and U: D/^ n U = D^iU. 

If set D;iU is not empty, then it means that it is possible to make a human 
subject to choose some non-risky alternative. Therefore, we should choose the 
corresponding reflexive control strategy from the set Z/j. However, if set D^iU 
is empty, we have to find the reflexive control strategy that will make human 
subject to select approved alternative from set U. For this purpose, we construct 
set Z[/ by including all the joint influences Z^^ for approved alternatives: Z-^ € 
^(7 "^^^ X £ U. Next we check whether set "Lu is empty. If set "Lu is empty 
this means it is impossible to convince a human subject to choose non-risky 
alternative. Therefore, the only option of reflexive control in this case is to put 
this subject into frustration state. However, if set is not empty, this means 
that there exist at least one reflexive control strategy that results in selection of 
alternative from the set of the approved (non-risky) ones. 

Therefore, the BCSRA inherits the entire structure of the BCSAS and aug- 
ments it with Module 2 of Asimov's Laws together with Interaction Modules 1 
and 2. 

The original schema of robot's control system has been recently presented 
in [20] ■ The BCSRA is extended version of the original schema. The BCSRA 
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provides comprehensive approach of how Forward and Inverse tasks are solved 
in the robot's "mind". 

Thus, in this section we have presented the formahzation of robotic agent in 
the RGT. We outhned the specific features of robotic agents, which distinguish 
them from other subjects. Furthermore, we provided detailed explanation of how 
the Forward and Inverse tasks are solved in the framrework of control system 
(BCSRA) of robots. 

Next, we proceed with consideration of sample sutiations of interactions be- 
tween humans and robots. 



6 Extended Sample Analysis of Mixed Groups 

Here we elaborate two examples, presented in the previous study [20], of how 
robots in the mixed groups can make humans refrain from risky actions. We 
discuss the application of the extended schema of robot's control system and 
provide explicit derivation of reflexive control strategies, which has been applied 
in these examples in the prevous study |20j . 

6.1 Robots Baby-Sitters 

Suppose robots have to play a part of baby-sitters by looking after the kids. We 
consider a mixed group of two kids and two robots. Each robot is looking after a 
particular kid. Having finished the game, kids are considering what to do next. 
They choose between "to compete climbing the high tree" (action a) and "to 
play with a ball" (action (3). Together actions a and /3 represent the active state 
l={a, f3} = {a} + {f3}. Therefore the Boolean algebra of alternatives consists of 
four elements: 1) the alternative {a} is to climb the tree; 2) the alternative {/3} 
is to play with a ball; 3) the alternative 1 = {a, (3} means that a kid is hesitating 
what to do; and 4) the alternative = {} means to take a rest. 

We consider that each kid considers his robot as ally and another kid and 
his robot as the competitors. The kids are subjects a and c, while robots are 
subjects b and d. The relationship graph is presented in Fig. |6] 



y N 



Fig. 6. The relationship graph for robots baby-sitters examples. 



Next we calculate the diagonal form and fold it in order to obtain decision 
equation for each subject: 
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[a][b] [cM 
[ab] +[cd] 
[ah + cd] — ab + cd . 

From two actions a and /?, action a is a risky action, since a kid can fall from 
the tree and this is real threat for his health or even life. Therefore according to 
Asimov's laws, robots cannot allow kids to start the competition. Thus, robots 
have to convince kids not to choose alternative {a}. In terms of alternatives, 
the Asimov's laws serve like filters which filter out the risky alternatives. The 
remaining alternatives are included into set U. In this case, U = {{/?}, {}}. 

Next we solve the Inverse taks, regarding alternatives {/?} and {}. We conduct 
the analysis regarding kid a. This analysis can be further extended for kid c in 
the similar manner. 

Solution of the Inverse task for kid a with approved alternatives as target 
choice. The decision equation for kid a is a = ab + cd. First, we transform it into 
canonical form; a = {b + cd)a + cda. 

Next we consider system of influence equations: 



b + cd = x (46) 
cd = X, (47) 



where alternative x S U. 

Regarding eq. ( 47 1 , eq. ( 46 1 is transformed into equation 



b + X = X (48) 

The solution of eq.([48| directly follows from Lemma [ij 0. Therefore 

for X = and x — {} the solutions are {/3} 3 6 3 and 6 = 0, respectively. 
The eq.(47l can be instantly solved according to Lemma[2j x^ + X c^ x- 



Consider x — {P} first. Then + {a}d 3 c D {/?}. By varying values of 
variable d, we obtain all the pairs (c, d): 

d = 1: {/?} D c D {/?} ^ c = {/?}. Therefore the solution is pair ({^}, 1); 

d = 0: {a} ^ c 3 {/3}. Since {a} n {/?} — {}, there is no solution; 

d —{a} : 3 c 3 {/3}. Since {/?} 3 {}, there is no solution; 

d —{P} : 1 ^ c 3 {/3}. Therefore there are two solutions (l,{/3}) and 

Therefore equation cd = {/?} has three solutions ({/?}, 1), (l,{/3}) and 

Thus, we have solved both equations from system ( 46p7 ). The solutions of 



this system are the triplets (6, c, d) of joint influences, which are all possible com- 
binations of solutions of both equations. Since there are two solution of eq.([46| 
and three solutions of eq.(|47]), there are six triplets (5, c, d) in total: (0, {/?}, 1) 
and ({/?}, {/?}, 1); (0, 1, {/3}) and ({/?}, 1, {/?}); (0, {/?}, {/3}) and ({/?}, {/?}, {/?}). 

Now we consider the case, when X — — {}■ Then d 13 c 3 0. We obtain 
pairs (c, d) for all values of variable d: 

d = 1: ll)cl50=>c = 0. Thus, there is only one solution (0,1); 
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d = 0: 1 D c D 0. Thus, there are four solutions (1,0), ({a},0), {{P},0) and 
(1,0); 

d ~ {a}: {/3} 3 c D 0. Thus, there are four solutions ({/?}, {a}) and (0, {a}); 
d = {/?}: {a} 2 d D 0. Thus, there are four solutions ({a}, {/?}) and (0, {/3}). 
In total, equation cd ~ has 9 solutions. Therefore system p9p0 | also has 



9 solutions as triplets ib,c,d): (0,1,0), (0,0,0), (0,0, {a}), (0,0, {/?}), (0,0,1), 
(0,M, {/?}), (0,{a},0), (0,{/?},{a}) and (0,{/3},0). 

We have considered two cases, when both upper and lower bounds of the 
interval of decision equation equal to the same alternative. Now we discuss a 
new situation, when variable a should take not a single value, but several values. 
In this case, we should find the joint influences (6, c, d) that result in selection 
of either alternative {/3} or {}. Since, {/?} 3 {}, we need to find all the triplets 
(6, c, d), resulting in the solution of decision equation as interval {/3} 3 a 3 {}. 
Thus, {/?} Da*D {}. 

Therefore, we need to solve the following system of equations: 

b + cd = {/?} (49) 
cd = 0. (50) 



The eq.(49l turns into equality b — {/?}, and we need to solve eq.([50|. How- 
ever, this equation has been already solved in the previous example. Therefore we 
obtian the solutions of the system ( [igpO] ): ({/?}, 1, 0), ({/?}, 0, 0), {{P}, 0, {a}). 



m, 0, {/?}), ({/?}, 0, 1), m, {«}, {/?}), ({/?}, {a}, 0), m, {«» and 
({/?}, {/3},0). 

Comparing solutions of all three system of influence equation, we can see 
that there are four remarkable solutions ({/?}, {/?}) and ({/?},{}, {/?}); 
({/?}, 1, {/?}) and ({/?}, {a}, {/?}). The first pair of solution results in choice of 
only alternative {/?}, while second pair of solutions results in selection of eighter 
alternative {/?} or alternative {}. These four solutions together illustrate that 
if 6 = d = {/?}, it is guaranteed that regardless of influence of kid c, kid a will 
choose either of approved alternatives. 

By analogy, we can see that among solutions of system ( 46p7 ) with x = {}, 



there are four solutions (0, 1, 0),(0, 0, 0), (0,{a},0) and (0,{/3},0). Therefore, if 
b = d = 0, kid a will choose alternative = {} regardless of influence of kid c. 

These two examples of binding variables b and d were considered in Scenario 1 
and Scenario 2 of sample situation with robot baby-sitters, originally presented 
in [2D]. 

Summarizing the results of this section, we have shown that robots can suc- 
cessfully control kids' behavior by refraining them from doing risky actions. The 
basic of this control is entirely based on the proposed schema of robot's control 
system. We have analyzed all the possible reflexive control strategies by solving 
three systems of influence equation: two systems regarding a single alternative 
and one system regarding the interval of alternatives. Therefore, we have shown 
how the Inverse task can be effectively solved by our proposed algorithm in 
situation similar to the real conditions. 
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6.2 Mountain-Climbers and Rescue Robot 

We consider that there are two cUmbers in the mountain and rescue robot. The 
chmbers and robot are communicating via radio. One of the chmbers (subject b) 
got into difficult situation and needs help. Suggest, he fell into the rift because 
the edge of the rift was covered with ice. The rift is not too deep and there is a 
thick layer of snow on the bottom, therefore climber is not hurt, but he cannot 
get out of the rift himself. The second climber (subject a) wants to rescue his 
friend himself (action a), which is risky action. The second option is that robot 
will perform rescue mission (action Since inaction is inappropriate solution 
according to the First Law, the set U of approved alternatives for robot includes 
only alternative {13} . The goal of the robot is to refrain the climber a from 
choosing alernative {a} and perform rescue mission itself. 

We suggest that from the beginning all subjects are in alliance. The cor- 
responding graph is presented in Fig. [ij: and its polynomial is abc. Therefore 
by definition it is homogenous group and, consequently, it is super-active group 
according to Lemma [3j 

Thus, any subject in the group is in active state. Therefore, group is un- 
controllable (see Section [sTs] ) . In this case, robot makes decision to change his 
relationship with the climber b from alliance to conflict. Robot can do that, for 
instance, by not responding to climber's orders. 

Which reflexive control leads to frustration state ? Then the polynomial corre- 
sponding to the new group is a(6-f c). This polynomial has been already broadly 



discussed in the Section 3.2 Therefore, we know decision equation for subject a: 



a — {b+c)a+a. We have shown as well that subject a can choose only alternative 



1 = {a,/3}, if appropriate joint influences are applied (see Section 3.2 1, overwise 
subject a is in frustration state and cannot make any choice. Therefore, in or- 
der to put subject a into frustration state, the reflexive control strategy should 



NOT be selected from the list of solutions (Section 3.2): ({/3},{a}); (l,{a}); 
({a},{/3}); (l,{/3}); (0,1); ({«},!); ({/3},1); (1,1) and (1,0). 

Here we provide two examples of such joint influences (6,c): ({q!},{q}) ^ 
({a} + {a}) = {«} C 1 and ({/?}, {}) => ({/?} + {}) = {/3} C 1. 

Whether robot can complete mission regardless of joint influences of other 
subjects? The decision equation for robot c is c — c + {b + a)c. The corresponding 
solution interval is 1 3 c 3 (& -f a). 

Here we analyze all 16 possible reflexive control strategies (a, b) that climbers 
can apply to robot c. 

Examples with emtpy set DU. For (0,6), there will be the same situation 
regardless of value of variable 6:113c3(fe + 0)=>13c3(6 + l)=>c=l. 

For (a, 1), there will be the same situation regardless of value of variable a : 
12 c2 (l + a) ^ c^ 1. 

For {{a}, {a}): IDcD {{a} + {a}) IDcD {{a} + {f3}) ^c^l. 

For ({/3},{/3}): IDcD ({/3} + {/?}) ^IDcD ({/?} + {a}) ^ c = 1. 
Therefore in these cases set D = {{q:,/^?}}. 

Next we consider other pairs (a, b). 
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(1, {a}): 1 ^ c D {{a} + T)^l 2 c D {a}. Here set D = {{a, /?}, {a}}. 
({/?}> {a}): 1 3 c D + {^}) ^ 1 3 c D {a}. Here set D = {{a, /?}, {a}}. 
({/?}, 0): 1 3 cD (0 + {/?}) ^IDcD {a}. Therefore, set D = {{a, {a}}. 
Since U = {{/?}}, DU = {} for all the cases considered above, robot will 
choose alternative {/?} from the set U. 

Examples with non-empty set DU. Consider the following pairs (a, 6): 
(!,{/?}): 1 3 cD ({/3}+T) ^12cD {f3}. Therefore, set D = {{a, l3}/{P}}. 
(1, 0): 1 D c D (0 + T) ^ 12_c 2 0. Thus, set D = {{a, /?}, {a}, {/?}, {}}. 
({a}, {/3}): 1 D c D ({/3} + W) ^ 1 ^ c D {/?}. Thus, set D = {{a, /3}, 
({a}, {/?}): 1 3 c D ({^{a}) ^ 1 3 c D {/?}. Thus, set D = {{a, /3}, {/?}}. 
({a}, 0): 1 D c D (0 + {a}) ^ 1 3 c D {^j. Thus, set D = {{a, /3}, {/?}}. 
Since U — {{/?}}, DU = {{/?}} for all the cases considered above, robot will 
choose alternative {/?} from the set DU. 

Thus, we have shown that under all 16 reflexive control strategies (a, fe), robot 
c can choose the alternative {/?}, which is to perform the rescue mission itself. 
Therefore robot will choose alternative {/3} regardless of the joint influences 
(a, b) of the climbers. 

The discussed example illustrates how robot can transform uncontrollable 
group into controllable one by manipulating the relationships in the group. In 
the controllable group by its influence on the human subjects, robot can refrain 
the climber a from risky action to rescue climber b. Robot achieves its goal by 
putting climber a into frustration state, in which climber a cannot make any 
decision. On the other hand, set U of approved alternatives guarantees that 
robot itself will choose the option with no risk for humans and implement it 
regardless of climber's influence. 

Therefore, in this section we have illustrated robot's ability to refrain human 
being from risky actions and to perform these risky actions itself. This proves 
that our approach achieves both goals of robotic agent: 1) to refrain people 
from risky actions and 2) to perform risky actions itself regardless of human's 
influences. 



7 Discussion and Conclusion 

Summarizing, the results of this paper, we outline the most important of them. 

First of all, we have introduced the Inverse task and developed the ultimate 
methods to solve it. 

We have provided a comprehensive tutorial to the brand new Reflexive 
Game Theory recently formulated and proposed by Vladimir Lefebvre [TJ [21 
[3J 13] . The tutoral contains the detailed description of the Forward and Inverse 
tasks together with methods to solve them. 

We propose control schemas for both abstract subject (BCSAS) and robotic 
agent (BCSRA). These schemas were specially designed to incorporate solution 
of the Forward and Inverse tasks, thus providing us with autonomous units 
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(individuals, subjects, agents) capable of making decisions in the human-like 
manner. We have shown that robotic agents based on BCSRA can be easily 
included into the mixed groups of humans and robots and effectively serve their 
fundamental goals (refraining humans from risky actions and, if needed, perform 
the risky acions itself). 

Therefore, we consider that present study provides the comprehensive overview 
of the classic RGT proposed by Vladimir Lefebvre [UEllSlll] and newly developed 
self-consistent framework for analysis of different kinds of groups and societies, 
including human social groups and mixed groups of humans and robots together 
with application tutorial of this new framework. 

This framework is entirely based on the principles of the RGT and brings 
together all its elements. The solution of the Inverse task, presented in this 
paper, plays a crutial role in formation of this framework. Therefore, by having 
the Inverse task as one of its fundamentals, this framework illustrates the role 
of the Inverse task and its relationship with other issues considered in the RGT. 
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Appendix 

A When sets A and B eire functions of less than total 
number of subject minus one VEiriables 

Consider groups of four subjects a, 6, c and d. Suggest the polynomial corre- 
sponding to this group is 6(a -\- d) -\- c. Next we construct diagonal form and 
perform folding operation: 

[a] + {d\ 

[6][a + d] 
\h{a + d)\ +[c] 

[6(a -I- rf) c] 



[a + rf] + [a] + [d]) 
\h{a^d)\ +[c] 

\h{a -I- d) c] 

[&] 

[6(o + d)\ +[c] 

[5(a + rf) + c] = 



= 6(a + d) + c + fe(a + d) + & + c 
Next wc simphfy the resultant expression of diagonal form folding: 



6(a d) -t- c -t- 6(a d) 6 c = 6(a rf) + c + 6(a 4- d)cb = 
6(a d) -I- c6 c6 -h h{a d)ch = b{{a + d) + c + b{a + d)c) + db = 
b{{a + d)c +{a + d)c + c+(b+{a + d))c) + cb = 
b{{a + d)c + {a + d)c + c + bc+ {a + d)c) + d) = 
b{c +{a + d)c + ((a + d) + (a + d))c) + cb = b{{a + d)c + c + c) + cb = 
b{{a + d)c + l) + d) = b + cb = b + c 
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Consequently, 



[b{a + d)] + [b] + [c] 



[b{a + d) + c] 



= b + c 



Therefore, the decision equation includes only two subject variables instead 
of four. Consequenly, for subjects a and d the decision equations in canonical 
forms are 



Thus, the sets A and B for subjects a and d are equal. The sets A and B are 
functions of only variables b and c: A = A{b, c) = b + cb and B = B(b,c) = b + cb. 
The canonical forms of decision equations for subjects b and c are: 



Therefore, set A = 1 for both subjects. Set B is a functions of a single 
variable: B{c) = c and B{b) = b for subjects b and c, respectively. 

B Example of non-homogenous super-active groups 

Here we provide an example of non-homogenous super- active group. 

Consider the group of four subject a, b, c and d, which is described by poly- 
nomial c{ab + b). Let us build the diagonal form and perform its folding: 



a = (6 + c)a + (6 + c)a 
d = {b + c)d -\- {b + c)d 



(51) 
(52) 



b = b + cb 
c = c + bc 



(53) 
(54) 



[a][b] 



[ab] 



+ [d] 



[c] [ab + d] 



[c{ab + d)] 



{[ab] + [a][b]) + [d] 



[c][ab + d] 



= [c{ab + d)] 



1 



[c] [ab + d] 



= [c{ab + d)] 



[c{ab + d)] + [c][ab + d] = 1 □ 



